A General Framework for Bounding Approximate Dynamic Programming Schemes
نویسندگان
چکیده
For years, there has been interest in approximation methods for solving dynamic programming problems, because of the inherent complexity computing optimal solutions characterized by Bellman's principle optimality. A wide range approximate (ADP) now exists. It is great to guarantee that performance an ADP scheme be at least some known fraction, say ?, optimal. This letter introduces a general approach bounding methods, this sense, stochastic setting. The based on new results greedy string optimization where one choose (ordered set) actions maximize objective function. technique inspired submodularity theory, but not required establishing bounds. Instead, quantifying certain notions curvature functions; smaller curvatures better bound. key insight any surrogate function coincides its solution and value with those original control problem. then yields mentioned above, determine ? depend specific ADP.
منابع مشابه
Approximate Dynamic Programming in Transportation and Logistics: A Unified Framework
Deterministic optimization has enjoyed a rich place in transportation and logistics, where it represents a mature field with established modeling and algorithmic strategies. By contrast, sequential stochastic optimization models (dynamic programs) have been plagued by the lack of a common modeling framework, and by algorithmic strategies that just do not seem to scale to real-world problems in ...
متن کاملA General Framework for Dynamic Programming Equation and Time Consistency
Time consistency is an important property for a sequence of stochastic optimization problem, especially with regard to risk attitudes. It is known that a sequence of problem is time-consistent if a Dynamic Programming equation holds. In this paper we present a general framework for Dynamic Programming where general aggregators in time and uncertainties are considered instead of the expectation ...
متن کاملExpAX: A Framework for Automating Approximate Programming
We present ExpAX, a framework for automating approxi-mate programming. ExpAX consists of these three compo-nents: (1) a programming model based on a new kind of pro-gram specification, which we refer to as error expectations.Our programming model enables programmers to implicitlyrelax the accuracy constraints without explicitly marking op-erations as approximate; (2) an ...
متن کاملRecursion Schemes for Dynamic Programming
Dynamic programming is an algorithm design technique, which allows to improve efficiency by avoiding re-computation of identical subtasks. We present a new recursion combinator, dynamorphism, which captures the dynamic programming recursion pattern with memoization and identify some simple conditions when functions defined by structured general recursion can be redefined as a dynamorphism. The ...
متن کاملExpectation-Oriented Framework for Automating Approximate Programming
We describe ExpAX, a framework for automating approximate programming based on programmer-specified error expectations. Three components constitute ExpAX: (1) a programming model based on a new kind of program specification, which we refer to as expectations. Our programming model enables programmers to implicitly relax the accuracy constraints without explicitly marking operations approximate;...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Control Systems Letters
سال: 2021
ISSN: ['2475-1456']
DOI: https://doi.org/10.1109/lcsys.2020.3003477